AITopics | text classification problem

Collaborating Authors

text classification problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models

Lu, Zhenyi, Tian, Jie, Wei, Wei, Qu, Xiaoye, Cheng, Yu, xie, Wenfeng, Chen, Dangyang

arXiv.org Artificial IntelligenceJun-11-2024

Text classification is a crucial task encountered frequently in practical scenarios, yet it is still under-explored in the era of large language models (LLMs). This study shows that LLMs are vulnerable to changes in the number and arrangement of options in text classification. Our extensive empirical analyses reveal that the key bottleneck arises from ambiguous decision boundaries and inherent biases towards specific tokens and positions. To mitigate these issues, we make the first attempt and propose a novel two-stage classification framework for LLMs. Our approach is grounded in the empirical observation that pairwise comparisons can effectively alleviate boundary ambiguity and inherent bias. Specifically, we begin with a self-reduction technique to efficiently narrow down numerous options, which contributes to reduced decision space and a faster comparison process. Subsequently, pairwise contrastive comparisons are employed in a chain-of-thought manner to draw out nuances and distinguish confusable options, thus refining the ambiguous decision boundary. Extensive experiments on four datasets (Banking77, HWU64, LIU54, and Clinic150) verify the effectiveness of our framework. Furthermore, benefitting from our framework, various LLMs can achieve consistent improvements. Our code and data are available in \url{https://github.com/Chuge0335/PC-CoT}.

classification, classification problem, llm, (16 more...)

arXiv.org Artificial Intelligence

2406.07001

Country:

Asia > Singapore (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
North America > Canada > Ontario > Toronto (0.04)
(2 more...)

Genre: Research Report > New Finding (0.93)

Industry: Banking & Finance (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

Text classification problems via BERT embedding method and graph convolutional neural network

Tran, Loc Hoang, Tran, Tuan, Mai, An

arXiv.org Machine LearningNov-30-2021

This paper presents the novel way combining the BERT embedding method and the graph convolutional neural network. This combination is employed to solve the text classification problem. Initially, we apply the BERT embedding method to the texts (in the BBC news dataset and the IMDB movie reviews dataset) in order to transform all the texts to numerical vector. Then, the graph convolutional neural network will be applied to these numerical vectors to classify these texts into their ap-propriate classes/labels. Experiments show that the performance of the graph convolutional neural network model is better than the perfor-mances of the combination of the BERT embedding method with clas-sical machine learning models.

bert, graph convolutional neural network, text classification problem

arXiv.org Machine Learning

2111.15379

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Text Preprocessing Methods for Deep Learning - DZone AI

#artificialintelligenceOct-25-2021, 14:31:31 GMT

Deep Learning, particularly Natural Language Processing (NLP), has been gathering a huge interest nowadays. Some time ago, there was an NLP competition on Kaggle called Quora Question insincerity challenge. The competition is a text classification problem and it becomes easier to understand after working through the competition, as well as by going through the invaluable kernels put up by the Kaggle experts. First, let's start by explaining a little more about the text classification problem in the competition. Text classification is a common task in natural language processing, which transforms a sequence of a text of indefinite length into a category of text.

deep learning, text data, vector, (14 more...)

#artificialintelligence

Country: North America > United States (0.05)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

How I achieved 90% accuracy on a text classification problem with ZERO preprocessing

#artificialintelligenceMar-27-2021, 17:45:22 GMT

I chose to use the AG news benchmark dataset. I recuperated the training and test test from John Snow Labs (a must see reference for all things NLP). This dataset is divided into four balanced categories for a total of 120,000 rows as seen below. The dataset is formatted into 2 columns, category and description. Because I want this to be a succinct post, I will refer you to my previous article to find out how to use Spark NLP in Colab.

accuracy, bert sentence, text classification problem, (3 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

A cost-reducing partial labeling estimator in text classification problem

Chen, Jiangning, Dai, Zhibo, Duan, Juntao, Hu, Qianli, Li, Ruilin, Matzinger, Heinrich, Popescu, Ionel, Zhai, Haoyan

arXiv.org Machine LearningJun-9-2019

We propose a new approach to address the text classification problems when learning with partial labels is beneficial. Instead of offering each training sample a set of candidate labels, we assign negative-oriented labels to the ambiguous training examples if they are unlikely fall into certain classes. We construct our new maximum likelihood estimators with self-correction property, and prove that under some conditions, our estimators converge faster. Also we discuss the advantages of applying one of our estimator to a fully supervised learning problem. The proposed method has potential applicability in many areas, such as crowdsourcing, natural language processing and medical image analysis.

machine learning, natural language, text classification, (17 more...)

arXiv.org Machine Learning

1906.03768

Country: North America > United States > New York (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
(2 more...)

Add feedback

Sentiment Analysis: Overview, Applications and Benefits

#artificialintelligenceAug-10-2017, 20:40:16 GMT

Mining such data to determine how people feel about your product, brand, or service, is called Sentiment Analysis. When applied to social media channels, it can be used to identify spikes in sentiment, thereby allowing you to identify potential product advocates or social media influencers. Companies such as Microsoft, IBM and smaller emerging companies offer REST APIs that integrate easily with your existing software applications. For example, using the following publicly available Sentiment Analysis REST API from a small start-up called Social Opinion, we pass in the text, "this phone is awesome", to the following URL: In the response, we can see the text has been identified as expressing positive emotion, with a 64% probability of that being true.

artificial intelligence, information extraction, information technology software, (18 more...)

#artificialintelligence

Industry: Information Technology > Software (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.97)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.97)

Add feedback